The TRGTK's System Description of the PatentMT Task at the NTCIR-10 Workshop

نویسندگان

Hao Xiong

Weihua Luo

چکیده

This paper introduces the TRGTK’s system for Patent Machine Translation at the NTCIR-10 Workshop. In this year’s program, we participate Chinese-English, English-Japanese and Japanese-English three subtasks. We submit required system results for Intrinsic Evaluation (IE), Patent Examination Evaluation (PEE), Chronological Evaluation (ChE), and Multilingual Evaluation (ME). Different from last year’s strategy, we focus on developing a strong and practical system for large-scale machine translation requirements. We design parallel algorithm for Chinese word segmentation, weights tuning and translation decoding, especially we propose a documental level translation method to improve the translation quality of special terms. Experimental results show that our system reduce the training and decoding time while still achieve promising translation results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Translation System for Patent Documents Combining Rule-based Translation and Statistical Postediting Applied to the NTCIR-10 PatentMT Task

In this article, we describe system architecture, preparation of training data and discussion on experimental results of the EIWA group in the NTCIR-10 Patent Translation Task. Our system is combining rule-based machine translation and statistical postediting. The thing about our new system compared with NTCIR-9 PatentMT task is to implement automatic selecting method from multiple translations...

متن کامل

The NiuTrans Machine Translation System for NTCIR-9 PatentMT

This paper describes the NiuTrans system developed by the Natural Language Processing Lab at Northeastern University for the NTCIR-9 Patent Machine Translation task (NTCIR-9 PatentMT). We present our submissions to the two tracks of NTCIR-9 PatentMT, and show several improvements to our phrase-based Statistical MT engine, including: a hybrid reordering model, large-scale language modeling, and ...

متن کامل

ZZX_MT: the BeiHang MT System for NTCIR-9 PatentMT Task

In this paper, we describe ZZX_MT machine translation system for the NTCIR-9 Patent Machine Translation Task(PatentMT). We participated in the Chinese-English translation subtask and submit three results, which correspond to three different models or decoding algorithms respectively. Both of the first two are phrase-based SMT approaches integrating the BTG constraint into reordering models, and...

متن کامل

TSUKU Statistical Machine Translation System for the NTCIR-10 PatentMT Task

This paper describes details of the TSUKU machine translation system in the NTCIR-10 PatentMT task [8] . This system is an implementation of our tree-to-string statistical machine translation model that combines a context-free grammar (CFG) parse tree and a dependency parse tree.

متن کامل

The HDU Discriminative SMT System for Constrained Data PatentMT at NTCIR10

We describe the statistical machine translation (SMT) systems developed at Heidelberg University for the Chinese-toEnglish and Japanese-to-English PatentMT subtasks at the NTCIR10 workshop. The core system used in both subtasks is a combination of hierarchical phrase-based translation and discriminative training using either large feature sets and `1/`2 regularization (for Japanese-to-English) ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

The TRGTK's System Description of the PatentMT Task at the NTCIR-10 Workshop

نویسندگان

چکیده

منابع مشابه

Machine Translation System for Patent Documents Combining Rule-based Translation and Statistical Postediting Applied to the NTCIR-10 PatentMT Task

The NiuTrans Machine Translation System for NTCIR-9 PatentMT

ZZX_MT: the BeiHang MT System for NTCIR-9 PatentMT Task

TSUKU Statistical Machine Translation System for the NTCIR-10 PatentMT Task

The HDU Discriminative SMT System for Constrained Data PatentMT at NTCIR10

عنوان ژورنال:

اشتراک گذاری